Political Leaning Categorization by Exploring Subjectivities in Political Blogs

نویسندگان

  • Maojin Jiang
  • Shlomo Argamon
چکیده

This paper addresses a relatively new text categorization problem: classifying a political blog as either ‘liberal’ or ‘conservative’, based on its political leaning. Instead of simply using “Bag of Words” features (BoW) as in previous work, we have explored subjectivity manifested in blogs and used subjectivity information thus found to help build political leaning classifiers. Specifically, our subjectivity based approach is two fold: 1) we identify subjective sentences that contain at least two strong subjective clues based on the General Inquirer dictionary; 2) from subjective sentences identified, we extract opinion expressions and BoW features to build political leaning classifiers. Experiments with a political blog corpus we built show that by using features from subjective sentences can significantly improve the classification performance. In addition, by extracting opinion expressions from subjective sentences, we are able to reveal opinions that are characteristic of a specific political orientation to some extent.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Political Blogs and Their Political Leanings

The Blogosphere has more influence on both the general public’s opinions and mainstream media nowadays. As vast political blogs representing grassroots voices go online, it is both interesting and useful to find them and determine their political leanings as either liberal or conservative. In this paper, we address both problems by using front pages of blogs in a corpus we built to create two c...

متن کامل

Preliminary Semantic Analysis of Political Blogs

In this paper, we present a series of semantic analyses of words in political blogs in the setting of categorization of two opposite political orientations: liberal vs. conservative. We classify nouns, verbs, adjectives and adverbs into semantic categories by using the General Inquirer dictionary. Then distributions of these categories and correlations among them are examined both within and be...

متن کامل

‘The Right to Information’: A Malaysian Political Blog Readers’ Perspective

Political blogs are one of the pivotal alternative communication channels for political news in Malaysia. Many have argued that the mushrooming of political blogs nurtures the effective realization of human rights in the country. The paper studies the ‘Malaysian political blog readers–human rights’ relationship by exploring these questions: Has traditional mainstream media become obsolete with ...

متن کامل

Shedding (a Thousand Points of) Light on Biased Language

This paper considers the linguistic indicators of bias in political text. We used Amazon Mechanical Turk judgments about sentences from American political blogs, asking annotators to indicate whether a sentence showed bias, and if so, in which political direction and through which word tokens. We also asked annotators questions about their own political views. We conducted a preliminary analysi...

متن کامل

Blog Classification with Co-training

In this project we use co-training to classify blogs by political leaning. We classify them into two pre-defined categories: liberal and conservative. We examine the performance of co-training versus normal supervised learning. We also look at using several different features to improve training.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008